A Rule Based Pronunciation Generator and Regional Accent Databank for Portuguese

نویسندگان

  • Simone Ashby
  • Sílvia Barbosa
  • Silvia Brandão
  • José Pedro Ferreira
  • Maarten Janssen
  • Catarina Silva
  • Mário Eduardo Viaro
چکیده

One of the major obstacles in deploying spoken language technologies (SLTs) in the developing world is a lack of key linguistic resources – e.g. electronic dictionaries, phonetically aligned corpora, pronunciation lexicons, etc. – that describe the non-dominant varieties spoken in such countries and regions. In this paper, we describe the work of the LUPo (Portuguese Unisyn Lexicon) project to model standard and non-standard varieties of spoken Portuguese from around the globe, and: (1) deliver a free, open-source tool for the automatic generation of accent-specific pronunciation lexica within the existing online lexical knowledge base, the Portal da Língua Portuguesa; and (2) provide the research and speech technology communities with a free, online, searchable database, the Portuguese RADbank, dedicated to the description of regional varieties of spoken Portuguese. Both resources are presented as bases for adapting SLTs to regional varieties spoken in the Luso-African and Luso-Asian world, as well as to non-standard varieties of Brazilian and European Portuguese.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Pronunciation Rules in Portuguese Regional Speech (PORT REG) for Coarticulation Process

This paper describes one aspect of an ongoing work to incorporate pronunciation variability in the Portuguese (PORT) speech system. This work focuses on the linguistic rules to improve the grapheme-(multi)phone transcription algorithm that will be implemented. Portuguese ‘Beira Interior’ regional speech (PORT-BI REG) is considered to be in the realm of coarticulation (post-lexical) phenomena. A...

متن کامل

Regional accent familiarity and speechreading performance

The effect of accent [pronunciation of speech sounds determined by a speaker’s regional or national location] on auditory speech comprehension has been well documented, but research is lacking as to its effects on visual speech understanding. In order to address this, the present study examined the effect of regional accent variation on speechreading performance. The aim was to determine if fam...

متن کامل

The Generation of Regional Pronunciations of English for Speech Synthesis1

Welsh and Northern English), and two American ones (New York and South Carolina, to represent Eastern and Southern American); regional features were based primarily on the descriptions in [1], with native-speaker input where possible. The regional accents are abbreviated in this paper as: Br(Sc) = Edinburgh; Br(W) = Cardiff; Br(N) = Leeds; Am(E) = New York; and Am(S) = South Carolina. For the s...

متن کامل

Game-based Teaching of Stress Placement on Multi-syllabic English Words

Accurate pronunciation is an important component of language ability and the main outward linguistic sign of whether someone is a native speaker of a language or not. An area of particular difficulty for Persian-speaking learners of English, which may cause 'foreign accent' or misunderstanding in speaking, is placement of stress on multi-syllable words. Game-based pronunciation teaching can be ...

متن کامل

CrossTowns: Automatically Generated Phonetic Lexicons of Cross-lingual Pronunciation Variants of European City Names

The CrossTowns lexicons are part of a study that focuses on the phonetic variants that occur when speakers of different native languages (L1) with varying degrees of target language (L2) proficiency pronounce foreign city names. Based on a collection of speech data from this domain, it is one of the aims to identify the most common pronunciation errors in a particular L1/L2 pair (language direc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012